Personalized Language Models for Computer-mediated Communication
نویسندگان
چکیده
In this paper, we investigate the performance of statistical language models on Instant Messaging (IM) data. Language Models (LM) are quite useful for modeling text data, and hence they are helpful in different contexts like spelling correction, speech recognition, part-ofspeech tagging etc. Construction of LM on a users past messaging data would be a strategy to model her writing style, and that LM can then be used to predict the next word in her future communications. However, we hypothesize that a user follows a specific pattern of communication with each of her virtual acquaintances. As a consequence, LM built on her entire messaging history would degrade the performance of the next word predictor, while communicating with a specific person. In this paper, we deploy a special method that excludes some specific message contents from the entire history in order to build LM. Our method suggests that, at the time of communicating with a specific user, a special LM should be invoked from a set of models for increasing accuracy. We analyze the IM data of a set of users, and show that our method performs well in terms of perplexity.
منابع مشابه
Learning Pragmatics through Computer-Mediated Communication in Taiwan
This study investigated the effectiveness of explicit pragmatic instruction on the acquisition of requests by college-level English as Foreign Language (EFL) learners in Taiwan. The goal was to determine first whether the use of explicit pragmatic instruction had a positive effect on EFL learners’ pragmatic competence. Second, the relative effectiveness of presenting pragmatics through two deli...
متن کاملIMPACT OF SYNCHRONOUS COMPUTER-MEDIATED COMMUNICATION ON EFL LEARNERS’ COLLABORATION: A QUANTITATIVE ANALYSIS
For the last two decades, computers have entered people’s lives in an unprecedented manner in a way that almost everybody considers life without them rather impossible. In recent years, researchers and educators have been trying to discover how computers and the Internet technology can maximize the quality of language instruction. As such, the present experimental study sought to investigate th...
متن کاملThe Effect of CMC in Business Emails in Lingua Franca: Discourse Features and Misunderstandings
The paper argues that everyday exchange of business emails produces a development in the work-group relationship, which, in turn, makes new communication styles possible and acceptable by the users' habit to computer-mediated forms, even in unbalanced professional exchanges. The focus is on the (spoken) discourse features of email messages in a self-compiled corpus of selected computer-mediated...
متن کاملL2 Learners’ Enhanced Pragmatic Comprehension of Implicatures via Computer-Mediated Communication and Social Media Networks
Second or foreign language (L2) learners’ development of interlanguage pragmatic (ILP) competence to understand and properly interpret utterances under certain social and cultural circumstances plays a pivotal role in the achievement of communicative competence. The current study was designed to explore the effects of synchronous computer-mediated communication (SCMC) and asynchronous com...
متن کاملPoliteness in Emails Exchanged between English and Persian Speakers
Nowadays, intercultural communication via email among various groups and societies has been increasingly important as an aspect of communication. This research aims at investigating aspects of politeness meaning negotiation via emails exchanged between English and Persian speakers with different cultural backgrounds. The present study also reveals the potentials for using emails to experience c...
متن کامل